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We combine searches by the CDF and DO collaborations for a Higgs boson decaying to W + W~ . The 
data correspond to an integrated total luminosity of 4.8 (CDF) and 5.4 (DO) fb _1 of pp collisions 
at -Js — 1.96 TeV at the Fermilab Tevatron collider. No excess is observed above background 
expectation, and resulting limits on Higgs boson production exclude a standard-model Higgs boson 
in the mass range 162-166 GeV at the 95% C.L. 

PACS numbers: 13.85.Rm, 14.80.Bn 



Finding the last unobserved fundamental particle in 
the standard model (SM), the Higgs boson, is a major 
goal of particle physics, and the search for its existence 
is a central component of Fcrmilab's Tevatron program. 
Direct searches at the CERN LEP collider have set a limit 
on the Higgs boson mass of rriH> 114.4 GeV at the 95% 
C.L. [l[. Combining this limit with precision electroweak 
measurements constrains the mass of the SM Higgs boson 
to be less than 186 GeV at the 95% C.L. 0. The favored 
mass range therefore places the SM Higgs boson within 
the reach of the experiments at the Fermilab Tevatron 
collider. 

In this Letter, we combine searches for Higgs bosons 
(H) decaying to W + W~ performed by the CDF and DO 
Collaborations 0,3]. These searches are particularly sen- 
sitive to a Higgs boson with mass 130<m#<200 GeV. 
The data analyzed correspond to integrated luminosities 
of 4.8 fb- 1 and 5.4 fib" 1 collected with the CDF and DO 
detectors, respectively. We use all significant production 
modes, namely, gluon-gluon fusion (gg^-H), associated 



* Deceased 



production (qq-^-WH or ZH), and vector boson fusion 
(qq^qqH , where the quarks radiate weak gauge bosons 
that fuse to form the H, and is referred to as VBF). 

The event selections used in the CDF and DO analyses 
are similar. Both collaborations select events with large 
missing transverse energy and two oppositely charged, 
isolated leptons, targeting the H^-W + W~ signal in 
which both W bosons decay leptonically. The DO selec- 
tion classifies events in three channels, e + e~, e fj/^, and 
The CDF selection separates opposite-sign di- 
lcpton candidate events into five non-overlapping chan- 
nels, classifying events by their jet multiplicity (0, 1, or 
> 2); the 0- and 1-jet channels arc further divided de- 
pending on whether one or both leptons are in the central 
part of the detector. In addition, CDF searches for Higgs 
boson events containing same-sign lepton pairs, mainly 
produced in WH and ZH associated production. 

The presence of neutrinos in the final state prevents 
full reconstruction of the Higgs boson mass. Other vari- 
ables are used to search for a signal in the presence of 
appreciable background. For example, the azimuthal an- 
gle between the leptons in signal events is smaller on 
average than that in background events due to the scalar 
nature of the Higgs boson and the parity violation in W ± 
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decays. The missing transverse energy is larger, and the 
total transverse energy of the jets is smaller, in signal 
events than in background events. The final discrimi- 
nants are binned neural-network outputs based on sev- 
eral kinematic input variables 0, 0] . A dedicated network 
is trained for each Higgs boson mass tested. For CDF, 
the inputs include likelihoods constructed from matrix- 
element probabilities. Compared with earlier Tevatron 
H^>W + W~ analyses, the new analyses use larger data 
samples, include all significant signal production mecha- 
nisms, and have undergone additional improvements in 
search sensitivity. 

The Higgs boson signals are simulated with pythia [j| , 
using CTEQ5L @ (CDF) and CTEQ6L1 Q (DO) par- 
ton distribution functions (PDF) at leading order (LO). 
We normalize our predictions for the Higgs boson sig- 
nals to the most recent higher-order perturbative QCD 
calculations available. References H, 0] and references 
therein provide the steps used to calculate the gg^H 
cross section. The MSTW 2008 next-to-next-to-leading 
order (NNLO) PDF set is used to predict the gg^H 
production cross section. The calculations of associ- 
ated production and VBF cross sections are described 
in Refs. [Ill4l3j ]. The branching fractions for the Higgs 
boson decays are obtained from hdecay 14( . After all se- 
lections, the total number of expected Higgs boson events 
is approximately 30 per experiment for to#=165 GeV, 
which corresponds to the region of greatest sensitivity. 

Both experiments determine the multijet background 
by studying control samples, which are then extrapo- 
lated into the signal regions. For CDF, backgrounds from 
SM WW, WZ, ZZ, Wj, Drell-Yan, and ^^production 
are generated using the pythia, mc@NLO [l5|], and the 
UB/eb [l(| programs. Backgrounds from W+jets pro- 
cesses, including (for CDF) scmileptonic diboson events, 
single top, and semileptonic ti events, are modeled us- 
ing VF+jets data events and a measurement of the 
rate at which jets are misidentificd as leptons. For 
DO, these backgrounds are generated using pythia, 

with PYTHIA providing 



17(, and COMPHEP 



ALPGEN 

parton-showering and hadronization for all generators. 

The diboson backgrounds are normalized using next- 
to-leading order (NLO) calculations from mcfm 
The ti and sin gle t op production cross sections are taken 
from Refs Jp,[IH and Ref. [H respectively. NNLO cal- 
culations [23J are used by both the CDF and DO Collabo- 
rations for the Drell-Yan background, and by DO for the 
inclusive W/Z processes. Other backgrounds are nor- 
malized to experimental data. Both collaborations use 
NLO simulations and data control samples to improve 
the modeling of differential distributions. Systematic un- 
certainties on the rates of the expected signal and the 
expected backgrounds, as well as on the shapes of the 
final discriminants, are included. More details are given 
in Refs. [1,13. 

We perform the combination twice, using Baycsian and 



modified frcqucntist approaches in turn. We check the 
consistency of the results to verify that the final result 
does not depend on the details of the statistical formula- 
tion. Both combinations test signal mass hypotheses in 
5 GeV steps for values of ran between 130 and 200 GeV, 
i.e., the mass range in which H^-W + W~ is the domi- 
nant decay mode. These two combinations give similar 
results (the limits agree within 5%). Both methods use 
the distributions of the final discriminants, and not just 
the total event counts passing selection requirements. 

Both statistical procedures form, for a given Higgs bo- 
son mass, a combined likelihood (including priors on sys- 
tematic uncertainties, tt(9)) based on the product of like- 
lihoods for the individual channels, each of which in turn 
is a product over histogram bins: 



riij. 



n 



I e 

k=l 



fe/2 



Nc N binB 

C(R,s,b\n,9)xn(9) = U J] ^ 
i=i 3 =i 

(1) 

where the first product is over the number of chan- 
nels {Nc), and the second product is over histogram 
bins containing riy events, binned in ranges of the fi- 
nal neural-network discriminants used for the individ- 
ual analyses. The predictions for the bin contents are 
jjL-ij = R x Sij{9) + bij(9) for channel i and histogram 
bin j, where Sy and b%j represent the expected SM sig- 
nal and background in the bin, and R is a scaling factor 
applied to the signal. By scaling all signal contributions 
by the same factor we make the assumption that the rel- 
ative contributions of the different processes at each m# 
are as given by the SM. Systematic uncertainties are pa- 
rameterized by the dependence of Sij and bij on 9. Each 
of the n sys components of 9, 9k, corresponds to a single 
independent source of systematic uncertainty scaled by 
its standard deviation, and each parameter may have an 
impact on several sources of signal and background in 
different channels, thus accounting for correlations. 

In the Bayesian method we assume a uniform prior in 
the signal yield. Gaussian priors arc assumed for the 9k , 
truncated so that no prediction is negative. The posterior 
density function is then integrated over the 9k (including 
correlations) and a 95% C.L. upper limit on R, Ru m , 
satisfies 



f Rlim / C(R, s, b\n, 9)Tr(9)d9dR 
r o °° / C(R, s, b\n, 9)ir(9)d9dR 



0.95. 



(2) 



The modified frcqucntist technique uses the statisti- 
cal variable CL S , defined in Ref. [24j, to test hypotheses 
which correspond to the presence or absence of Higgs 
boson signals. The test statistic is the log-likelihood 
ratio LLR = -2 In ^Ita^ . wher e p(data|s + b) and 
p(data|b) are the probabilities that the data are drawn 
from the s+b and b-only hypotheses respectively. The 
probabilities p are computed using the best-fit values of 
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the parameters 6k, separately for each of the two hy- 
potheses [25| . The use of these fits extends the proce- 
dure used at LEP 24[, improving the sensitivity when 
the expected signals are small and the uncertainties 
on the backgrounds are large. Two p- values are com- 
puted: CL b = p(LLR> LLR obs |b) and CL s+b = p(LLR> 
LLR bs|s + b), where LLR Q b s is the value of the test 
statistic computed for the data. The ratio CL S = 
CL s+ b/CLb is used to define confidence intervals and is 
chosen to reduce the potential for excluding a signal for 
which there is insufficient sensitivity. If CL S <0.05 for a 
particular choice of s+b, that hypothesis is excluded at 
the 95% C.L. Systematic uncertainties are included by 
fluctuating the predictions for Sij and 6^ when generat- 
ing the pseudoexperiments used to compute CL s+ b and 
CL b . 

Though many sources of systematic uncertainty differ 
between the experiments and analyses, all appropriate 
correlations are taken into account in the combined lim- 
its. The dominant systematic uncertainties arise from 
cross section calculations for the signals and the back- 
grounds; these are correlated between the experiments. 
Variations of the parton distribution functions and the 
rcnormahzation and factorization scales give rise to un- 
certainties of 11% for the gluon-gluon fusion process [2(| . 
5% for associated WH and ZH production [Til, EH , and 
CDF, which uses analyses sepa- 



partially correlated between the experiments, via the 4% 
uncertainty on the inelastic pp cross section 28(. Ad- 



10% for VBF 11, 1 



rated in jet multiplicity bins, applies a channel (jet bin) 
dependent uncertainty of 7% to 70% and a gluon PDF 
uncertainty of 8% on gg^H, following the treatment 
discussed in Ref. [2f|. For the gluon-gluon fusion sig- 
nal process, we study the effects on the acceptance and 
the kinematics of scale variations, gluon PDF variations, 
and the differences between next-to-next-to-leading log 
calculations and the generators used for the central pre- 



dictions, using the fehip and hnnlo programs [8l. 127 1. 
Wc find additional uncertainties of 5% to 10%. The pri- 
mary background, W + W~ production, has a cross sec- 
tion uncertainty of 7% and a similar study of the ac- 
ceptance and kinematics finds additional uncertainties of 
approximately 1% to 5%. The systematic uncertainties 
on WZ, ZZ, it, single top production, and Drell-Yan 
production range from 7% to 10%. The uncertainties 
on the multijet background are uncorrelated between the 
experiments and range from 2% to 15%. The uncertain- 
ties on the yields of W+jets and W7(+jets) range from 
7% to 30%, but these have small effects on the results be- 
cause the rates of these backgrounds are low. Because the 
methods of estimating the W+ jets and Wj(+jets) back- 
grounds differ between CDF and DO, we assume there is 
no correlation between these rates. The uncertainties on 
the lepton identification and the trigger efficiencies are 
uncorrelated between the experiments and range from 
2% to 6%. The uncertainty on the integrated luminos- 
ity of 6% is taken to be correlated between the signal 
and the Monte-Carlo-based background predictions, and 



ditional details related to the treatment of systematic 
uncertainties arc given in Refs. 0, H[. As bin by bin 
uncertainties arising from the statistical uncertainty in 
the Monte Carlo (and in some cases data) samples were 
shown to affect the observed and expected limits by less 
than 1%, they are neglected. 

To better visualize the impact of the data events, we 
combine the histograms of the final discriminants, adding 
the contents of bins with similar s/b ratios, so as not to 
dilute the impact of highly-sensitive bins with those with 
less discriminating power. Figure [1] shows the signal ex- 
pectation and the data with the background subtracted, 
as a function of the s/b ratio of the collected bins. The 
background model has been fit to the data, and the uncer- 
tainties on the background are those after the systematic 
uncertainties have been constrained by the fit. No excess 
of candidate events in the highest s/b bins relative to the 
background expectation is observed. 



CDF + DO Run II 

L=4.8-5.4 fb" 1 



Data-Background 
Signal 

±1 s.d. on Background " 




-1 -0.5 
log 10 (s/b) 

FIG. 1: (color online). Background-subtracted data distribu- 
tions for the discriminant histograms, summed for bins with 
similar s/b, for the mn — 165 GeV combined search. The 
background has been fit to the data under the b-only hypoth- 
esis, and the uncertainty on the background is the post-fit 
systematic uncertainty. The signal, which is normalized to 
the SM expectation, is shown with a filled histogram. The un- 
certainties shown on the background-subtracted data points 
are the square roots of the post-fit background predictions in 
each bin, representing the expected statistical uncertainty on 
the data. 

Before extracting the combined limits, we study the 
LLR distributions for the s+b and b-only hypotheses, 
shown in Fig. [2] as functions of mu- The separation 
between the median LLRb and LLR s+ b divided by the 
widths is a measure of the discriminating power of the 
search. The value of LLR Q b s relative to the expected s+b 
and b-only distributions indicates whether the observed 
data are more consistent with the presence of signal, or 
not. No significant excess of data above the background 
expectation is seen for any value of mjj. Because the 
same data events are used to construct the observed LLR 
at each mn tested, the LLR values are highly correlated 
from one mjj to the next. This also applies to Figs. [3] 
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and [4] described below. 
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FIG. 2: (color online). Distributions of LLR as functions of 
the Higgs boson mass. We display the median values of the 
LbR distribution for the b-only hypothesis (LLRb), the s+b 
hypothesis (LLR s +b), and for the data (LLR b s ). The shaded 
bands indicate the 68% and 95% probability regions in which 
the LLR is expected to fluctuate, in the absence of signal. 

We extract limits on SM Higgs boson production in 
pp collisions at y/s = 1.96 TcV in the m H = 130-200 GeV 
mass range. We present our results in terms of i?n m , the 
ratio of the limits obtained to the rate predicted by the 
SM, as a function of the Higgs boson mass. We assume 
the production fractions for WH, ZH, gg—>H, and VBF, 
and the Higgs boson decay branching fractions, are those 
predicted by the SM. A value of i?u m less than or equal 
to one indicates a Higgs boson mass that is excluded at 
the 95% C.L. 

The ratios of the expected and observed limits to the 
SM cross section are shown in Fig. [3] as a function of m# ■ 
The observed and median expected ratios arc listed in Ta- 
ble HI with observed (expected) values for the Bayesian 
method of 1.04 (0.92) at m H = 160 GeV, 0.93 (0.87) at 
m H = 165 GeV, and 1.26 (1.04) at m H = 170 GeV. We 
use piecewise linear interpolations to display the combi- 
nation results in Figs. [2HU and to quote the observed 
and expected excluded mass ranges. We exclude the 
SM Higgs boson in the mass range 162 to 166 GeV. The 
Bayesian calculation, chosen a priori, was used for this 
exclusion. The corresponding expected exclusion, from 
159 to 169 GeV, encompasses the observed exclusion. 
The CL S calculation yields similar results, as shown in 
Fig. |4j The 1-CL S distribution, which can be directly 
interpreted as the level of exclusion of our search, is dis- 
played as a function of the Higgs boson mass. For in- 
stance, our expected limit shows that in the absence of 
signal the median 1-CL S value with which we expect to 
exclude a SM Higgs boson of mass 165 GeV is 97%. 

In summary, we present the first combined Teva- 
tron search for the SM Higgs boson using the 
H^tW + W~ decay mode. No significant excess of can- 
didates is found above the background expectation for 
130<m#<200 GeV. We exclude the mass range from 
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FIG. 3: (color online). Observed and expected (median, for 
the background-only hypothesis) 95% C.L. upper limits on 
SM Higgs boson production. The shaded bands indicate the 
68% and 95% probability regions in which 7?ii m is expected 
to fluctuate, in the absence of signal. The limits displayed in 
this figure are obtained with the Bayesian calculation. 
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FIG. 4: (color online). Distribution of 1-CL S as a function 
of the Higgs boson mass obtained with the CL S method. The 
shaded bands indicate the 68% and 95% probability regions 
in which the LLR is expected to fluctuate, in the absence of 
signal. 

162 to 166 GeV at the 95% C.L. This is the first di- 
rect constraint on the mass of the Higgs boson beyond 
that obtained at LEP. 
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TABLE I: Ratios, Ru m , of the median expected and observed 95% C.L. limits to the SM cross section for the combination of 
CDF and DO analyses as a function of the Higgs boson mass in GeV, obtained with the Bayesian (upper) and the CL B (lower) 
methods. 



Bayesian 


130 


135 


140 


145 


150 


155 


160 


165 


170 


175 


180 


185 


190 


195 


200 


Expected 
Observed 
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